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© Method of enrichment and cloning for DNA containing an insertion or corresponding to a deletion. 



© The present invention concerns a method for 
enriching for, and optionally, clonally isolating, gene 
sequences which contain an insertion or which cor- 
respond to deletion. The method is capable of such 
enrichment and isolation even when the insertion or 
deletion has not been genetically characterized, or is 
incapable of causing a discernible deficiency. 
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© The present invention concerns a method for 
enriching for, and optionally, clonaily isolating, gene 
sequences which contain an insertion or which cor- 
respond to deletion. The method is capable of such 
enrichment and isolation even when the insertion or 
deletion has not been genetically characterized, or is 
incapable of causing a discernible deficiency. 
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METHOD OF ENRICHMENT AND CLONING FOR DNA CONTAINING AN INSERTION OR CORRESPONDING 

TO A DELETION 



Field of the Invention 

The invention pertains to recombinant DNA 
technology, and, more specifically, to a method for 
enrichment and cloning of a desired nucleic acid 
molecule. 



Background of the Invention 

The ability to identify and clone a particular, 
desired gene sequence from a virus, prokaryote or 
eukaryote is of tremendous significance to molecu- 
lar biology and medicine. Such cloned gene se- 
quences can be used to diagnose the presence of 
a disease or the predilection of a patient to mani- 
fest a disease. Such cloned gene sequences can 
also be used to express a desired gene product 
and therefore can potentially be used for applica- 
tions ranging from catalysis to gene replacement. 
The study of such expressed products can provide 
insight into the pathology and treatment of human 
disease. 

A variety of methods have been developed for 
isolating and cloning desired gene sequences. Ear- 
ly methods permitted only the identification and 
isolation of gene sequences which possessed a 
unique property such as proximity to a prophage 
integration site, capacity for self-replication, distinc- 
tive molecular weight, extreme abundance, etc. 
(The Bacteriophage Lambda , A.D. Hershey, d., 
Coid Spring Harbor Press. Cold Spring Harbor, NY 
(1971); Miller, J.H. Experiments in Molecular Ge- 
netics, Cold Spring Harbor Press, Cold Spring Har- 
bor, NY (1972); Molecular Biology of the Gene , 
Watson, J.D. et aL, (4th ed.) Benjamin/Cummings, 
Menlo Park, CA (1987); Darnell, J. et aL Molecular 
Biology , Scientific American Books, NY, NY (1986)- 
). Because these methods relied upon distinctive 
properties of a gene sequence, they were largely 
(or completely) unsuitable for identifying and clon- 
ing most gene sequences. 

In order to identify desired gene sequences 
which lacked a distinctive property, well character- 
ized genetic systems (such as Escherichia coli , 
Saccharomyces cerevisiae , maize, mammalian 
cells, etc.) have been exploited. In accordance with 
this method, cells are mutagenized (by chemical 
(such as UV light, hydroxylamine, etc. (Miller, J.H. 
Experiments in Molecular Genetics , Cold Spring 
Harbor Press,~Cold Spring Harbor, NY (1972)) or 
genetic means (such as transposon tagging (Davis, 
RW. et al. A Manual for Genetic Enginee ring, 



Press, Cold Spring Harbor, NY (1980)) to produce 
mutants having discernible genetic deficiencies. A 
desired gene sequence is then identified by its 
capacity to complement (i.e. remedy) the genetic 
5 deficiencies of such mutant cells. Such genetic 
identification permitted the genetic characterization 
of the gene sequences, and the construction of 
genetic maps which localized the gene sequence 
to a region of a particular chromosome (Taylor, 
10 Bacteriol. Rev. 34:1 55 (1 970)). 

With the advent of recombinant DNA technol- 
ogies, it became possible to clone (i.e. to phys- 
ically isolate) such genetically characterized gene 
sequences. Random fragments of a genome could 
75 be introduced into self-replicating vectors to pro- 
duce gene libraries, each of whose members con- 
tain a unique DNA fragment (Maniatis, T. et al., Itk 
Molecular Cloning, a Laboratory Manual , Cold 
Spring Harbor Press, Cold Spring Harbor, NY 
20 (1982)). By screening the members of such librar- 
ies for those capable of complementing the defi- 
ciency of a mutant cell, it was possible to clone the 
desired gene sequence. 

Although these methods permit the identifica- 
25 tion and cloning of many gene sequences, they 
may be employed only where a host cell exists 
which has a mutation conferring a discernible defi- 
ciency, and the gene sequence can be cloned into 
a gene sequence delivery system (such as a vec- 
30 tor) capable of entering the host cell and being 
expressed. 

The capacity to physically isolate certain gene 
sequences has led to the development of methods 
which are capable of isolating a desired gene se- 
35 quence even in the absence of mutations or vec- 
tors. 

In one such technique, known as "chromosome 
walking," a desired sequence can be obtained by 
isolating a gene sequence which is capable of 
40 hybridizing to a particular reference sequence. This 
isolated gene sequence is then employed as a 
reference sequence in a subsequent hybridization 
experiment in order to clone a gene sequence 
which is adjacent to, and which partially overlaps, 

45 the originally isolated sequence. This newly iso- 
lated sequence will be physically closer to the 
desired gene sequence that was the originally iso- 
lated sequence. This process is repeated until the 
desired gene sequence has been obtained. As will 

so be appreciated, the ability to clone a gene se- 
quence, in the absence of genetic mutants or vec- 
tors, requires some initial information concerning 
the nucleotide sequence or restriction en- 
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quence. 

Alternatively, the chromosome of a virus or cell 
can be characterized to produce a physical map 
based on either nucleotide sequence or restriction 
endonuciease cleavage data (i.e. an RFLP map). 
Using such a map, restriction fragments of the 
chromosome can be cloned without any prior de- 
termination as to their genetic function. 

More recently, gene cloning has been achieved 
by producing synthesizing oligonucleotide mole- 
cules whose sequence has been deduced from the 
amino acid sequence of an isolated protein, by 
forming cDNA copies of isolated RNA transcripts, 
by differential colony or library subtractive hybrid- 
izations using either two different RNA sources, or 
cDNA and RNA. 

Although these methods may be employed 
even in the absence of either mutants or a gene 
sequence delivery system, they permit a desired 
gene sequence to be identified and cloned only if 
sequences naturally linked to the desired sequence 
have been characterized and isolated, or if the 
sequence or restriction map of such sequences 
has been obtained. Since such data is often un- 
available, these methods are often incapable of use 
in identifying and cloning a desired gene se- 
quence. 

Thus, in summary, the ability to clone DNA 
corresponding to a locus defined only by a muta- 
tion is a relatively simply matter when working with 
E. coli, S. cerevisiae or other organisms in which 
transformation and complementation with genomic 
libraries is feasible. Chromosome walking tech- 
niques may be used in other organisms to clone 
genetically defined loci if the mutant was obtained 
by transposon tagging, if the locus can be linked to 
markers in an RFLP map. or if an ordered library 
for the genome exists. Unfortunately, there are 
numerous organisms in which mutants with inter- 
esting phenotypes have been isolated but for which 
such procedures have not been developed. Thus, 
many gene sequences cannot be isolated using the 
above methods. 

Existing techniques can, in general, only be 
applied to permit the identification and isolation of 
a desired gene sequence if certain additional con- 
ditions are fulfilled. These conditions require the 
existence of either genetic or structural data. In 
view of these constraints, methods capable of per- 
mitting, or facilitating, the identification and cloning 
of gene sequences which cannot easily be isolated 
using current techniques are of great interest. 

Brief Description of the Figure 

Figure 1 is a diagramm of the enrichment 
and cloning method of the preferred embodiment 



of the present invention. DNA is depicted as a solid 
line; biotinylated DNA is depicted as a striped 
. black/white line; Sau3a adaptors are shown as an 
open line; avidin beads are shown as speckled 
5 circles; radiolabelled fragments are shown with as- 
terisks. 

Summary of the Invention 

io A technique, called "genomic subtraction," has 

been developed for isolating the DNA that is absent 
in deletion mutants. The method removes from 
wild-type DNA all of the sequences that are 
present in both the wild-type and the deletion mut- 
75 ants genomes. The DNA that remains is derived 
from the deleted region. The method of genomic 
subtraction has been exemplified by cloning DNA 
from a yeast strain corresponding to a 5 kb yeast 
DNA deletion. In brief, the method achieves an 
20 enrichment for the deleted sequences by allowing 
wild-type DNA to reassociate with an excess of 
biotinylated deletion mutant DNA. After reassocia- 
tion, the biotinylated sequences are removed by 
binding to avidin coated beads. This subtraction 
25 process was then repeated several times. In each 
cycle the unbound wild-type DNA from the pre- 
vious round was hybridized with fresh biotinylated 
deletion mutant DNA. Samples of unbound DNA 
from each cycle were capped with adaptors (such 
30 as Sau3A adapters) and amplified by using one 
strand of the adaptor as a primer in the polymerase 
chain reaction. Replicas of a wild-type yeast 
genomic library were probed with the labelled am- 
plified products. All of the genomic clones that 
35 hybridized to the labelled DNA from the third round 
of subtraction were found to contain sequences 
that are deleted in the mutant. 

The present invention thus concerns a method 
for enriching for, and optionally clonally isolating, 
40 gene sequences which contain an insertion or 
which correspond to deletion. Of special concern is 
a method capable of such enrichment and isolation 
even when the insertion or deletion has not been 
genetically characterized, or is incapable of caus- 
45 ing a discernible deficiency. 

In detail, the invention provides a method for 
enriching for a desired gene sequence absent in 
one population of nucleic acid molecules but 
present in another which comprises: 
so a) mixing a first population of denatured nu- 

cleic acid molecules suspected of possessing the 
desired gene sequence with an excess amount of a 
second population of denatured nucleic acid mol- 
ecules suspected of lacking the desired gene se- 
55 quence, wherein the second population of mol- 
ecules has been labelled with a label, and wherein 
the first population of molecules lacks the label; 

b) permitting the mixed molecules of the first 
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and second populations of molecules to hybridize 
and thereby form double-stranded nucleic acid 
molecules; and 

c) isolating any double-stranded molecules 
of the mixture which are composed solely of re- 
natured molecules of the first population. 

The invention further concerns the above-de- 
scribed methods wherein step (c) comprises the 
sub-steps: 

1. incubating the mixed first and second 
populations of molecules with an agent capable of 
binding to the label of the second population of 
molecules; 

2. permitting the agent to bind to the label of 
the second population of molecules; and 

3. separating any of the second population 
of molecules from the mixture by selectively sepa- 
rating the bound agent, along with its bound, label- 
led, molecule of the second population of mol- 
ecules, from the mixture. 

The invention further concerns the above-de- 
scribed methods wherein the label of the labelled 
second population of molecules is a biotin label, or 
an antibody label. 

The invention further concerns the above-de- 
scribed methods wherein the agent is an antibody 
or avidin, either in solution, or bound (as to a bead 
or resin). 

The invention also concerns the above-de- 
scribed methods wherein the avidin in solution, 
after binding to the labelled molecule of the second 
population of molecules, is removed from solution 
along with the bound, labelled, molecule by affinity 
separation with biotin, or by immunoaffinity separa- 
tion with an anti-avidin antibody. 



Description of the Preferred Embodiments 

The present invention concerns a method for 
enriching for, and clonally isolating a gene se- 
quence present in one nucleic acid population but 
absent in another. 

As used herein, the term "gene sequence" is 
intended to refer to a nucleic acid molecule which 
may be either DNA or RNA. The preferred Nucleic 
acid molecule composing the gene sequence will 
be a DNA molecule. The gene sequence may be of 
any length, and may possess any nucleotide se- 
quence. The gene sequence may be capable of 
expressing a gene product (i.e. it may possess 
sufficient regulatory sequences to enable its tran- 
scription into RNA and its translation into a protein 
or polypeptide), or, alternatively, it may be incapa- 
ble of such expression. The term "gene sequence" 
is further intended to include sequences which are 
non-transcribed (such as regulatory sequences), 



quences), or which encode genes or gene frag- 
ments. A "gene sequence" may contain more than 
one such sequence. The gene sequences of the 
present invention may be viral, prokaryotic or 
5 eukaryotic, or may be synthetic or recombinant. 

The desired gene sequences of concern to the 
present invention are "present in one nucleic acid 
population but absent in another." 

Such a desired gene sequence may comprise 
w an insertion which is present in one sequence but 
absent in another. Examples of such an insertion 
include an integrated virus, gene duplication, 
mutational insertion, transposon insertion, etc. Al- 
ternatively, such a desired gene sequence may 
75 comprise an extrachromosomai element such as a 
plasmid, episome, virus, etc. which is present in 
one source but absent from another. Such a de- 
sired gene sequence may alternatively comprise a 
sequence which is incapable of hybridizing with a 
20 second sequence (such as by being highly diver- 
gent with respect to the second sequence). The 
desired gene sequence may alternatively comprise 
a sequence which corresponds to a deletion that is 
present in a particular reference sequence. The 
25 term "deletion" is intended to refer to the absence 
of a particular sequence of nucleotides from a 
particular DNA or RNA molecule which normally 
and naturally contains the sequence. 

A nucleic acid sequence is said to "have a 
30 deletion" if, when compared to a second nucleic 
acid sequence, it is found to lack at least one 
nucleotide which is present in that sequence. For 
example, if one nucleic acid sequence contains the 
sequence " ATG CGTAA," and a second sequence 
35 contains the sequence "ATGAA," then the second 
sequence is said to have a deletion of the se- 
quence "CGT." The deleted nucleotides ("CGT") 
being between " ATG " and "AA." 

A nucleic acid sequence is said "to have or to 
40 carry an insertion" or "to correspond to the dele- 
tion" if it contains a sequence which corresponds 
to all or some of the sequence of nucleotides which 
is absent from the sequence which has the dele- 
tion. Thus, referring to the above example, the 
45 sequence "CGT" corresponds to the sequence de- 
leted in the sequence " ATGAA ." 

The expressions "contain an insertion" or 
"correspond to a deletion" are intended to be 
equivalent, and to reflect merely whether ones 
so bases ones reference point on the molecule con- 
taining the deletion or on the molecule which cor- 
responds to the deletion. Thus, for example, if a 
particular sequence, normally present in a nucleic 
acid molecule, is absent from the nucleic acid 
55 molecule of a mutant, then the mutant is said to 
"have or carry" a deletion of these sequences. The 
present invention permits one to enrich for and 

^l^nollw icrtlato s» ni iHoip apiH mnlfln ilo whiph Pnr- 



EP 0 372 524 A2 



8 



responds to this deletion. 

Similarly, if a particular sequence is normally 
not present in a nucleic acid molecule (such as the 
sequence of an integrated virus), but is present in 
the nucleic acid molecule of a mutant, then the 
mutant is said to "have or carry" an insertion of 
these sequences (i.e. to have or carry a sequence 
which corresponds to a "deletion" in the non-mut- 
ant). The present invention permits one to enrich 
for and clonaliy isolate a nucleic acid molecule 
which contains this insertion (i.e. which corre- 
sponds to this "deletion"). 

The methods of the present invention are 
equally applicable regardless of whether the dele- 
tion results from the loss of a nucleotide sequence 
from a reference nucleic acid molecule or whether 
it reflects an insertion in one nucieic acid molecule 
which is not present in a reference nucleic acid 
molecule. Importantly, therefore, the term 
"deletion" denotes only that a comparison has 
been made between two nucleic acid sequences, 
and that one sequence has been found to lack a 
sequence found in the other. As used herein, the 
sequence which is "lacked" by one molecule is 
known as the deletion. The sequence in the other 
molecule which corresponds to the deletion may 
be equivalently either an insertion or a sequence 
which "corresponds to the deletion." 

One object of the present invention is to 
achieve the "enrichment" of gene sequences which 
correspond to a particular deletion (or which con- 
tain an insertion). As used herein, the term 
"enrichment" is intended to refer to an increase in 
the concentration of a particular gene sequence 
with respect to the total concentration of gene 
sequences in a sample. 

Once a desired gene sequence has been en- 
riched for. it is possible to insert the enriched gene 
sequence into a suitable viral or plasmid vector and 
thereby obtain the clonal isolation of the enriched 
gene sequence. Alternatively, the sequence can be 
labeled and used to probe other sequences. Meth- 
ods which can be used to accomplish such inser- 
tion are disclosed, for example, by Maniatis, T. et 
al. (In: Molecular Cloning, a Laboratory Manual , 
ColdHspring Harbor Press, Cold Spring Harbor, NY 
(1982)). 

I. Methods of Deletion/Insertion Cloning 

Two general approaches have been described 
for cloning sequences that are present in one strain 
and absent in another. The first approach, differen- 
tial screening, has been used to clone the esc 
gene from Drosophila . Using genomic DNA from 
strains with and without a deletion to probe replicas 
of a genomic library poses technical difficulties 



which become daunting for large genomes. In addi- 
tion, the deletion must cover at least one entire 
insert in the genomic library which does not con- 
tain any repeated sequences. 

5 The second approach, competitive hybridiza- 

tion, provides an elegant alternative to differential 
screening. This technique was used by Lamar et al. 
(Cell 37:171-177 (1984)) to isolate clones specific 
for the human Y chromosome. In accordance with 

w this method, an excess of sheared DNA from a 
human female is denatured and reannealed along 
with a small amount of DNA from a male (the male- 
derived DNA having been previously treated to 
have sticky ends). Most of the male DNA reas- 

?5 sociated with the sheared DNA yielding unclonabie 
fragments lacking sticky ends. Fragments unique to 
the Y chromosome, however, could only reas- 
sociate with the complementary restricted strand 
(derived from the Y chromosome). Such reassocia- 

20 tion thus formed clonable fragments with sticky 
ends. This technique has also been used success- 
fully to clone DNA corresponding to deletions in 
the Duchenne muscular dystrophy locus, and 
choroideremia. 

25 Unfortunately, the competitive hybridization 

method does not provide a large enough degree of 
enrichment. For example, enrichments of about one 
hundred fold were obtained for the sequences of 
interest in the above experiments. With enrich- 

30 ments of such low magnitude, the technique is 
practical only when dealing with large deletions. 
Indeed, even if the deletion covered 0.1% of the 
genome, many putative positive clones have to be 
tested individually by labeling and probing genomic 

35 Southern blots (Southern, J., J. Molec. Biol. 98:503- 
517 (1975)). The method as it stands, then, is not 
practical for deletions on the order of 1 kbp 
(kilobasepair) unless one is dealing with a small 
prokaryotic genome. 

40 

II. THE DELETION/INSERTION CLONING METH- 
OD OF THE PRESENT INVENTION 

45 The present invention achieves the enrichment 

of gene sequences which correspond to a deletion 
using iterative rounds of subtractive hybridization to 
enrich for sequences absent in one population of 
nucleic acid molecules but present in another. The 

so method is diagrammed in Figure 1 . 

In accordance with the methods of the present 
invention, two sources of nucleic acid are required. 
The first source contains a nucleic acid molecule 
which lacks a sequence present in the nucleic acid 

55 of a second source. The sequence which is absent 
from the first source (the source which "contains a 
deletion") but which is present in the second 
source (the source which "contains a sequence 
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which corresponds to the deletion") is the "desired 
sequence" whose enrichment and cloning is the 
object of the methods of the present invention. 

The deletion carried in the nucleic acid of the 
first source can be induced artificially, as through 
the use of mutagens or transposable elements 
(Miller, J.H. Experiments in Molecular Genetics , 
Cold Spring Harbor Press, Cold Spring Harbor, NY 
(1972); Davis, RW. et at. A Manual for Genetic 
Engineering Advanced Bacterial Genetics , Cold 
Spring Harbor Press, Cold Spring Harbor, NY 
(1980)), etc., alternatively, the deletion may arise 
naturally during evolution, and thus reflect the ge- 
netic relatedness (or lack thereof) between the two 
sources. 

The nucleic acid of the two sources is ex- 
tracted and mixtures of nucleic acid molecules are 
purified from other cellular components. For this 
purpose, one may follow the procedures of Man- 
iatis, T. et al., In: Molecular Cloning, a Laboratory 
Manual , "Cold Spring Harbor Press, Cold Spring 
Harbor, NY (1982)). 

The nucleic acid molecule which carries the 
deletion may be either DNA or RNA. The nucleic 
acid molecule which corresponds to the deletion 
may also be either DNA or RNA. However, due to 
the desirability of amplifying this molecule after its 
enrichment, it is preferable for the nucleic acid 
molecule which corresponds to the deletion to be a 
DNA molecule. The most preferred embodiment of 
the present invention employs DNA sources for 
both the deletion gene sequence and the desired 
gene sequence. In the following discussion of the 
invention, the invention will be described in terms 
of this preferred embodiment. 

Once a nucleic acid sample has been obtained 
which contains the deletion gene sequence, this 
sample is treated to decrease the size of the nu- 
cleic acid molecules which it contains. For this 
purpose, one or more restriction endonucleases (if 
the sample is DNA) are most preferably employed. 
The most preferred restriction endonuclease is 
Sau3a, although other restriction endonucleases 
can be employed. It is desirable for the nucleic 
acid molecule to be cleaved into fragments which 
range from 0.1 -100 kb in size, and more prefer- 
ably, 0.1 - 10 kb in size. For a deletion of approxi- 
mately 5 kb, the DNA is preferably reduced to 
approximately 3 kb in size. Methods for cleaving 
DNA molecules with restriction endonucleases are 
provided by Maniatis, T. et al.. In: Molecular Clon- 
ing, a Laboratory Manual . Cold Spring Harbor 
Press? Cold Spring Harbor, NY (1982)). 

The sample which contains the nucleic acid 
molecule that carries the deletion is sheared, using 
conventional means, such that the average length 
of the nucleic acid molecules is similar to that of 



this sample is modified, preferably with biotin, such 
that the molecules will become able to bind to free 
or immobilizable avidin (such as an avidin bead or 
column (U.S. Patent No. 4,228,237 (Hevey et al.); 
5 Hofman et ah, J. Amer. Chem. Soc. 100:3585-3590 
(1978)). The modification is most preferably ac- 
complished using photobiotin (Vector Laboratories). 
Other methods can, of course, be employed to 
produce such biotinylated molecules. 
70 Once the above procedures have been accom- 

plished, the two treated nucleic acid samples are 
mixed and incubated under conditions sufficient to 
cause denaturation of any double stranded mol- 
ecules. It is, alternatively, possible to denature the 
75 separated preparations, and then to mix the single- 
stranded preparations together. The nucleic acid 
samples are to be mixed together such that the 
concentration of the nucleic acid molecule which 
carries the deletion will be in excess over the 
20 concentration of the nucleic acid molecule which 
corresponds to the deletion. 

Once mixing/denaturation has been accom- 
plished, the mixed sample is cooled and incubated 
under conditions sufficient to permit hybridization 
25 and the reassociation of complementary strands. 
The conditions of such hybridization can be con- 
trolled to enhance or optimize the rate of hybridiza- 
tion. For example, high concentrations of salts such 
as 1 M NaCI, Dextran Sulfate, 4 M Lithium Chlo- 
30 ride, 2 M Ammonium Sulfate, etc. may be used. 
Such incubation will result in the production of five 
classes of nucleic acid molecules: 

(1 ) Double-stranded molecules both of 
whose strands are biotinylated/ Such molecules 

35 result from the reannealing of two strands both 
derived from the sample containing the nucleic 
acid molecule which carries the deletion. Since 
both stands are biotinylated. and thus derived from 
the source which carried the deletion, no member 

40 of this class of molecules can contain the desired 
gene sequence (which corresponds to the deletion 
and therefore must come from the other source). 

(2) Double-stranded molecules having only 
one biotinylated strand. Such molecules result from 

45 the reannealing of a strand derived from the sam- 
ple containing the nucleic acid molecule which 
corresponds to the deletion, and a strand derived 
from nucleic acid molecule of the sample which 
contained the deletion. Since such molecules are 

so hybrids of both sources of nucleic acid, all mem- 
bers of this class of molecules contain sequences 
which are unrelated to the deletion sequence (since 
that sequence is not present in the biotinylated 
nucleic acid source). 

55 (3) Single-stranded biotinylated molecules. 

Such molecules result from the failure of a nucleic 
acid molecule derived from the deletion containing 
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class of molecules is derived from the sample 
containing the nucleic acid molecuie which carries 
the deletion, no member of this class of molecules 
can contain the desired gene sequence. 

(4) Singie-stranded non-biotinylated mole- 
cules. Such molecules result from the failure of a 
nucleic acid molecule derived from the source con- 
taining a nucleic acid molecule which corresponds 
to the deletion to reanneal to another strand. Since 
this class of molecules is derived from the sample 
containing the desired gene sequence, members of 
this class of molecules potentially contain the de- 
sired gene sequence. 

(5) Double-stranded molecules neither of 
whose strands are biotinylated. Such molecules 
result from the reannealing of two strands both 
derived from the sample containing the nucleic 
acid molecule which corresponds to the deletion. 
Since this class of molecules is derived from the 
sample containing the desired gene sequence, 
members of this class of molecules potentially con- 
tain the desired gene sequence. Members of this 
class of molecules are especially desirable since 
their double-stranded nature permits them to be 
incorporated into cloning vectors. 

As will readily be perceived in light of the 
above discussion, the number of molecules which 
fall into classes 4 and 5 will decrease as more of 
the biotinylated, deletion-containing nucleic acid is 
added to the mixture. Thus by having such 
biotinylated DNA in excess, the amount of class 4 
or class 5 molecules (relative to the amounts of 
class 1 , 2, and 3 molecules) will be low. 

Despite the fact that the unwanted molecules 
of classes 1, 2, and 3 will be present in great 
excess over the desired molecules of classes 4 
and 5, it is possible to easily remove the unwanted 
molecules from the mixture by virtue of the fact 
that the molecules of classes 1, 2 and 3 are 
biotinylated. 

Such biotinylated molecules, whether hybridiz- 
ed to a complementary biotinylated molecule (class 
1 ), hybridized to a complementary non-biotinylated 
molecule (class 2), or single-stranded (class 3), can 
be removed from the sample in any of a variety of 
ways. Such molecules can be incubated in the 
presence of free avidin in solution or suspension, 
under conditions sufficient to permit the avidin to 
bind to the biotin of the biotinylated molecules. The 
preparation may then be incubated in the presence 
of bound biotin (for example, biotinylated beads, 
resins, columns, etc.) under conditions sufficient to 
permit the biotinylated nucleic acid bound-avidin to 
bind to the biotin of the column (avidin possesses 
multiple biotin binding sites, and can, thus, simulta- 
neously bind to the biotinylated nucleic acid mol- 
ecule and to the biotinylated bead or resin). An 
immobilized avidin column is most preferably used 



for this purpose. In. lieu of an avidin column, it is 
possible to employ other methods for removing the 
biotinylated molecules from the mixture. For exam- 
ple, one could employ a streptavidin-cellulose ma* 

5 trix, avidin-agarose or avidin coupled to magnetic 
beads, etc. Unbound molecules of classes 4 and 5 
can then be recovered from the etuate of the 
biotinylated column, or from the supernatant of the 
centrifuged biotinylated bead, etc. 

ig In an alternative, preferred, method, the 

biotinylated molecules of classes 1-3 are incubated 
in the presence of avidin beads, resins, etc. (i.e. an 
immobiiizabte or immobilized avidin). For this pur- 
pose, avidin coated polystyrene beads are espe- 

75 cially preferred. The biotinylated nucleic acid mol- 
ecules and the avidin are incubated under con- 
ditions which permit the biotinylated molecules to 
bind to the avidin. The avidin-bound molecules are 
then removed from solution by centrifugation, sedi- 

20 mentation, affinity chromatography, or more prefer- 
ably, by microcentrifuge filtration, etc., to produce a 
solution containing only the desired class 4 and s 
molecules. 

As will be readily appreciated in view of the 
25 foregoing discussion, the present invention does 
not require the use of biotin and avidin. Such 
reagents are employed in the most preferred em- 
bodiment of the invention. Any pair of reagents 
capable of specific binding to one another can be 
30 used provided that one of the reagents can be 
Hnked to a nucleic acid molecule and that such a 
linked molecule will not be incapable of hybridizing 
with a complementary nucleic acid molecule. Ex- 
amples of such alternative agents include modify- 
35 ing the nucleic acid molecules with an antigen (for 
example, digoxigenin) and purifying the undesired 
molecules from the mixture with an antigen specific 
antibody (such as antidigoxigenin antibody). 

The direct effect of the affinity chromatography 
40 will be to enrich for those gene sequences which 
correspond to the sequence of the deletion. The 
nucleic acid obtained as the "flow-through" of the 
affinity chromatography, or as the solution resulting 
from the removal of the biotinylated nucleic acid 
45 bound-avidin beads, does not contain any biotiny 
lated gene sequences. Such nucleic acid can be 
recovered and repeatedly cycled (using additional 
biotinylated, deletion-containing nucleic acid mol- 
ecules. Such repeated cycles are capable of exten- 
so sively enriching the concentration of a desired 
gene sequence. 

In general, 3-5 such cycles are conducted in 
order to enrich a particular 5 kb yeast sequence to 
homogeneity. Since the yeast genome contains 
55 approximately 105 Sau3a fragments, and a 5 kb 
fragment would be expected to have approximately 
20 Sau3a sites, the methods of the present inven- 
tion, which are capable of isolating fragments con- 
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taining each of these 20 sites, provides an enrich- 
ment of several thousand fold. 

Once a sufficient degree of enrichment has 
been obtained, the enriched gene sequence can be 
introduced into a suitable cloning vector. The effi- 
ciency of cloning can be substantially enhanced if 
the enriched gene sequence is amplified prior to 
cloning. Although any method of amplification can 
be used, it is most preferable to amplify the gene 
sequences using the polymerase chain reaction 
described by Mull is in U.S. Patent No. 4,683,195, 
and by Saiki et aL ( Science 239 : 487-491 (1988)), 
which references are incorporated herein by refer- 
ence. In a preferred embodiment, the fragments 
are "capped" by the ligation of adaptors that have 
sticky ends. Adapters having a Sau3A cleaved ter- 
minus comprise the preferred sticky end adapter. 
Alternatively, any linkers or adaptors may be used. 
If desired, such linkers or adaptors may be added 
to the termini of the molecules at any stage, even 
prior to the first hybridization. The ligated frag- 
ments can them be amplified by using a Taq 
polymerase and an excess of one adaptor strand 
which serves as a primer. Capping with adaptors 
and amplification is a general strategy for am- 
plifying fragments of unknown sequence. Radioac- 
tive nucleotides are incorporated into the amplified 
DNA in a final round of polymerization. The label- 
led DNA is used to screen a genomic library made 
from wild-type DNA. The most intense signals 
should correspond to colonies that contain DNA 
coinciding with the sequences that are deleted in 
the mutant. 

Once the DNA has been amplified, either by 
cloning or by the polymerase chain reaction, it can 
be used to screen any sort of library (i.e., cDNA, 
etc.) or blot (Northern or Southern). The amplified 
DNA can be cloned and used to screen blots or 
libraries. It is possible to use such a cloned am- 
plified library with labeled amplified DNA in order 
to target clones that correspond to enriched se- 
quences in the probe. 

The concept of using subtractive hybridization 
was used to isolate mRNAs corresponding to DNA 
that had been deleted from a bacteriophage T4 
deletion mutant (Bautz et aL. Science 151 :328-330 
(1966). Several laboratories have recently pub- 
lished methods for enrichment of deleted DNA 
sequences from genomic DNA (Lamar et aL, Cell 
37:171-177 (1984); Welcher et aL, NucL Acids Res. 
14: 10027- 10044 (1986); Bjourson et aL, AppL 
Environ. Microbiol. 54:2852-2855 (1988)). The 
genomic subtraction procedure described herein 
makes it possible to achieve much greater enrich- 
ments than was possible using such methods. En- 
richments obtained by single step competitive hy- 
bridization techniques such as the one developed 



177 (1984)) can be no greater than the ratio of 
deletion mutant to wild-type DNA. If a twenty fold 
excess of deletion mutant DNA is hybridized with 
wild-type DNA, one twentieth of all of the wild-type 
5 DNA strands will reassociate with another wild-type 
strand. This problem, which is a factor in any 
enrichment scheme that uses double stranded 
DNA, can be overcome either by using a great 
excess of deletion mutant DNA or by employing a 
io iterative approach as we have done here. Subtrac- 
tion protocols using single stranded cDNA and 
mRNA (Timberlake, DeveL BioL 78:497-510 (1980)) 
do not require these measures since strands within 
the two nucleic acid populations cannot anneal to 
rs each other. Actual enrichments using the single 
step method have been less than 100 fold (Lamar 
et aL, Cell 37:171-177 (1984); Kunkei et aL, Proc. 
NatTAcad. Sci. USA 82:4778-4782 (1985); Nuss- 
baum et aL, Proc. Natl. Acad. Sci. USA 84:6521- 
20 6525 (1987)). Two previous methods that employ 
multiple rounds of subtraction have resulted in the 
purification of sequences occurring in one strain of 
bacteria but not in another (Welcher et aL, Nucl. 
Acids Res. 14:10027-10044 (1986); Bjourson et aL, 
25 AppL Environ. Microbiol. 54:2852-2855 (1988)). The 
enrichment factors required to achieve purification 
of the bacterial fragments were much smaller 
(about 200 fold smaller in reference 14) than in the 
experiment reported here. It is difficult to simulta- 
30 neously obtain both impressive enrichments and 
useful amounts of DNA using techniques that lack 
an amplification step. 

Thus, combining DNA subtraction with an am- 
plification step allows attainment of large enrich- 
35 ment factors while minimizing the amount of DNA 
required. The methods of the present invention 
preferably accomplish such amplification by incor- 
porating the polymerase chain reaction to amplify 
the DNA after the final round of subtraction. The 
40 polymerase chain reaction is described by Mullis in 
U. S. Patent No. 4,683,195, which reference is 
incorporated herein by reference. 

Using the amplified DNA to probe a library 
greatly reduces screening time compared to 
45 screening Southern blots with individual clones. 
The signal generated by a particular sequence in 
the probe is proportional to its fractional repre- 
sentation in the probe. Thus, the fragments in the 
probe that are deleted in the mutant need not be 
so purified in order to pinpoint the corresponding 
clones in a library. The desired fragments must 
only be more highly represented in the probe than 
any other particular fragment. 

High enrichment factors depend on minimizing 
55 the accumulation of amplifiabte contaminating spe- 
cies in the unbound DNA fractions. Two classes of 
fragments are expected to contaminate the un- 
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class is composed of reassociated wild-type 
strands that do not come from the deleted region, 
sequences that are single stranded because of 
insufficient reassociation times, and sequences that 
have annealed to mutant DNA that did not stick to 
the avidin coated beads. Possible non-random con- 
taminants are fragments that can not hybridize to 
biotinylated DNA during subtraction (e.g. snap-back 
sequences or fragments with T m 's below the hy- 
bridization temperature), those that correspond to a 
deletion at another locus, repetitive sequences that 
are much more numerous in the wild-type strain 
than in the deletion strain (in this case the 
biotinylated sequences may not be in sufficient 
excess), and DNA from organisms contaminating 
the wild-type strand. 



ill. USES OF THE METHODS OF THE PRESENT 
INVENTION 

The methods of the present invention can be 
used to generate a collection of restriction frag- 
ments from one strain that do not occur in another 
strain. These strains may differ in that one is dis- 
tantly related to the other or in that one has been 
mutagenized with agents that cause high rates of 
deletion mutants. Oligonucleotides are synthesized 
that correspond in sequence to each strand of each 
such fragment. These oligonucleotides are pooled. 
DNA is made from a small sample of tissue and 
combined with the pooled oligonucleotides and 
then amplified using the polymerase chain reaction. 
If the tissue came from, or contained, the strain 
lacking the restriction fragments, no bands are ap- 
parent on an ethidium stained gel after amplifica- 
tion. However, if the tissue came from, or con- 
tained, the strain with the restriction fragments, one 
band appears corresponding to each fragment. The 
two strains can be crossed and the F1 progeny 
mated. The resulting F2 organisms are analyzed by 
amplification of DNA using the oligonucleotide 
primers followed by gel electrophoresis. The co- 
segregation of the fragments can be scored gen- 
erating a linkage map of these fragments. 

This approach simplifies the mapping of new 
genetic markers. The mutant, obtained in the strain 
containing the fragments, is crossed with the strain 
lacking the fragments and then the progeny (F1) 
mated with a sibling. The progeny of this cross 
(F2) are then scored for phenotype. DNA is made 
from a small sample of tissue and amplified using 
the pooled probes from the previous paragraph. 
The amplified products are scored for the presence 
and absence of the various fragments on a gel. In 
this way the phenotypic marker can be linked ge- 
netically to restriction fragment markers. 

Similarly, phylogenetic trees can be construct- 



ed by isolating fragments that occur in one or- 
ganism but that are absent (or have diverged so 
that they do not hybridize under the conditions 
used in the subtraction protocol) in a distantly 
5 related organism. Oligonucleotides are synthesized 
corresponding to such fragments. Tissue from re- 
lated organisms could then be scored for related- 
ness to the two original organisms by simply am- 
plifying a small amount of DNA using the 
io oligonucleotides as primers. A phylogenetic tree 
can be constructed from the pattern of bands ob- 
served on a gel. 

Similarly, a collection of human fragments can 
be obtained using this technique that are present in 
75 subsets of human populations. Oligonucleotides 
can be synthesized corresponding to each strand 
of each such fragment. DNA from a single cell 
forensic sample (such as a hair follicle) can be 
amplified using these pooled oligonucleotides as 
20 primers. The resulting band pattern is comparable 
to a fingerprint of the individual, and can be used in 
forensics in the same manner as a fingerprint. 

The method can be used to isolate new 
pathogens (e.g., viruses). DNA from an infected 
25 individual can be treated as the wild type DNA in 
deletion cloning, DNA from uninfected tissue from 
the same individual or from another individual(s) 
(preferably both of the individual's uninfected par- 
ents) is treated as the deletion mutant DNA in 
30 deletion cloning. 

The method can be used to isolate sex deter- 
mining loci, disease resistance determining loci, 
disease causing loci, or drug resistance loci, if 
such loci contain unique restriction fragments 
35 present in one organism but not the other. 

DNA that is absent in one tissue but present in 
another can also be isolated using the method. For 
example, a tumor might arise if both functional 
copies of a recessive oncogene are deleted. Using 
40 the method with tumor DNA and DNA from un- 
affected tissue would allow the recessive oncogene 
to be cloned. 

The present invention also permits one to iden- 
tify the nature of a bacterial, viral, yeast or fungal 
45 pathogen, and thereby provides an aid in diagnos- 
ing the etiology of infectious diseases. As de- 
scribed above, a collection of restriction fragments 
is made from one strain that do not occur in 
another strain. These strains may differ in that one 
so is closely or distantly related to the other. 
Oligonucleotides are synthesized that correspond 
in sequence to each strand of each such fragment 
These oligonucleotides are pooled. DNA from a 
small sample of tissue, or bodily fluid (blood, sa- 
55 liva, urine, stools, etc.) is obtained, combined with 
the pooled oligonucleotides and then amplified 
(preferably using the polymerase chain reaction). If 
the sample contained DNA from the strain lacking 
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the restriction fragments, no bands are apparent on 
an ethidium stained gel after amplification. How- 
ever, if the sample contained DNA from the strain 
with the restriction fragments, one band appears 
corresponding to each fragment. By comparing the 
obtained result with that generated by known 
strains, it is possible to identify the strain. 

The application of genomic subtraction is sub- 
ject to several constraints. Strains with homozyg- 
ous or hemizygous deletions of the locus to be 
cloned must be available and viable. The deletion 
must cover at least one restriction fragment (for 
example, one Sau3A fragment) that is composed 
entirely of non-repetitive sequences. Ideally, the 
two strains should be isogeneic so that sequences 
covered by deletions occurring at irrelevant loci are 
not recovered. 

Cloning DNA that corresponds to sequences 
missing in deletion mutants should be especially 
useful in systems in which mutants can be isolated 
but for which established cloning methods have not 
yet been developed. Because of the ease, effi- 
ciency and speed of genomic subtraction, this 
method may also prove valuable in organisms in 
which other cloning techniques such as transposon 
tagging or chromosome walking are commonly 
useful. For organisms with large genomes or those 
that exhibit polyteny, mutants can be screened 
cytologically for deletions. In the absence of 
cytological evidence, extensive genetic fine struc- 
ture analysis is required to prove that a mutation is 
caused by a deletion. Rather than proving that a 
mutant is caused by a deficiency, genomic sub- 
traction can be performed on a series of allelic 
mutants that have been induced by treatments that 
cause deletions at a high frequency. X-rays (Cote 
et aL, Genetics 112:769-783 (1986); Kelley et aL, 
Genetics 109 :365-377 (1 985)), diepoxybutane 
(Reardon et aL, Genetics 115:323-331 (1987)), and 
chlorambucil (Russell et al., Proc. Natl. ScL USA 
86:3704-3708 (1989)) for example, can induce high 
proportions of deletion mutants. The methods of 
the present invention are designed in a way that 
facilitates processing multiple samples with a mini- 
mum of effort. 

Thus, genomic subtraction can be used to iso- 
late genes that cause human diseases in situations 
in which either patients with deletions can be found 
or cell lines that differ in the presence or absence 
of the locus can be identified. The method can also 
be used to isolate pathogen DNA from infected 
tissue. Virulence genes can be cloned from a 
pathogen if a related non-pathogenic strain exists. 
Clones obtained by applying the method to dis- 
tantly related organisms can be used to probe 
intermediate species allowing the construction of 
phylogenetic trees. Similarly, DNA fragments that 



be used as RFLP markers. If appropriate 
oligonucleotides are synthesized, many markers 
can be evaluated simultaneously by amplification of 
genomic DNA. The technology we have developed 

5 can be applied to most experiments that require 
subtraction or positive selection of nucleic acids. 
The method may be extended to isolate DNA of 
higher plants and animals by accelerating the kinet- 
ics of the DNA reassociation reactions (Wieder et 

w al., Biopolymers 20:1537-1547 (1981)). 

Having now generally described the invention, 
the same will be more readily understood through 
reference to the following examples which are pro- 
vided by way of illustration, and are not intended to 

75 be limiting of the present invention, unless speci- 
fied. 



EXAMPLE 1 

20 



PREPARATION OF DNA 

25 

In order to illustrate the generally applicable 
methods of the present invention, DNA was ex- 
tracted from two strains of Saccharomyces 
cerevisiae: strain T1753 (alpha, ura3 , trpl , his3 , 

30 leu2 , can r , cir°), and TD33.3 (alpha, ura3 , trpl , his3 , 
ieu2 , can r , cir° and Del( lys2 )). The Del( lys2 ) muta- 
tion of strain TD33.3 contains a well characterized 
5 kb deletion. The sequence corresponding to this 
deletion has previously been cloned and character- 

35 ized. The cells were grown under conventional cul- 
turing conditions, harvested and treated to recover 
the genomic DNA. 

DNA was extracted as described by Rogers et 
al. ( Plant Molec. Biol. 5:69-76 (1985). which refer- 

40 ence is herein incorporated by reference). DNA for 
genomic subtraction should be pure (i.e. free of 
DNA from biological contaminants, RNA, 
nucleotides, polysaccharides and proteins). It is 
important that genomic DNA concentrations be 

45 measured carefully. Genomic DNA concentrations 
are preferably determined by comparing band in- 
tensities after gel electrophoresis of the genomic 
DNA samples and several dilutions of bac- 
teriophage 1 DNA of known concentration. Mea- 

50 surements of DNA concentration by absorbance 
spectroscopy are not generally sufficiently accurate 
for this application. For organisms that are rich in 
polysaccharides and/or glycoproteins, CTAB preps 
are useful (Murray et al., Nucleic Acids 8:4321- 

55 4325 (1980), which reference is incorporatecTherein 
by reference). 

For the preparation of yeast DNA, pellets from 
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240 ml 167 mg/ml sorbitol/0.2% (v/v) bME/0.1 
mg/ml zymolase (Seikagaku; 105 units/mg) 100 
mM EDTA. After incubation for 1 hr at 37° C the 
cells were spun for 20 min at 4* C in a Beckman 
JS4.2 rotor at 4200 rpm. The pellets were re- 
suspended in 60 ml 50 mM giucose/25 mM Tris 
HCI pH 8/10 mM EDTA, To the resulting 100 ml 
suspension we added 25 ml of 3.5 M NaCI/0.1 M 
EDTA/0.5 M Tris HCI pH 8. DNA was prepared 
from this suspension according to the method of 
Murray and Thompson (Murray et aL, Nucleic Ac- 
ids 8:4321-4325 (1980), Rogens. S. O. et aL, Plant 
Molec. Biol. 5: 69-76 (1985)). The pellet from the 
final ethanol~(EtOH) precipitation step was re- 
suspended in 10 ml 10 mM Tris HCI pH 8/1 mM 
EDTA (TE). RNase A (Sigma) (Maniatis et aL, Mo- 
lecular Cloning: A Laboratory Manual (Cold Spring 
Harbor Laboratory, Cold Spring Harbor, New York) 
(1982)) was added at 100 mg/ml and the solution 
was incubated at 37* C for 1 hr. The solution was 
brought to 0.3 M NaOAc and extracted twice with 
phenol/chloroform (1:1). After adding 2 volumes of 
EtOH, DNA was spooled out of the solution, re- 
suspended in 5 ml TE and then brought to 0.3 M 
Sodium Acetate (NaOAC). DNA was spooled out of 
EtOH solution twice more, washed in 100% EtOH 
and resuspended in TE. A portion of the genomic 
DNA isolated from strain TD33.3 was sheared by 
sonication to an average size of about 3000 bp 
using an Ultrasonics mode W-375 sonicator. The 
sheared DNA was concentrated to 0.5 ml by sec- 
butanol extraction, spun at 13,000 X g for 1 min to 
remove some insoluble material, denatured by boil- 
ing for 3 min. brought to 0.3 M NaOAC, precipitat- 
ed by adding 2 volumes of EtOH, washed in 100% 
EtOH. dried, and resuspended at 1 mg/ml in water. 
Tris buffer was avoided at this stage since it may 
react with photobiotin. 

A Sau3A adaptor was prepared by annealing 
the synthetic oligonucleotides GACACTCTC- 
GAGACATCACCGTCC and GATCGGACGGTGAT- 
GTCTCG AG AGTG . The latter oligonucleotides was 
phosphorylated at the 5 end using T4 poly- 
nucleotide kinase (New England Bioabs) (Maxam et 
aL, Methods Enzymol 65:499-560 (1980)). An equal 
mass of each of the two strands were combined. 
The mixture was heated to 100*C for 2 min in a 
200 ml water bath which was then allowed to cool 
to room temperature. At one end, the resulting 
adaptor has a Sau3A compatible sticky end with a 
5 phosphate and at the other a Sau3A incompati- 
ble 5' overhang. The incompatible overhangs and 
single 5' phosphate insure that only one adaptor 
molecule is ligated onto each end of a Sau3A 
fragment. 

The Bglll fragment from the region deleted in 
strain TD33.3 was subcloned by ligating a Bglll 
digest of Ylp33.3 into pUC13 that had been cut 



with BamHL The deleted fragment was excised 
from this plasmid by cutting with EcoRI and Pstl 
(the Bglll sites had been destroyed during sub- 
cloning). This fragment, referred to as the "5 kb 

5 Bglll fragment" below, was gel purified on a 2% 
low-melting agarose gei (Ausubel et aL, Current 
Protocols in Molecular Biology (Greene Publishing 
Associates~and John Wiley & Sons, New York) 
(1987), which reference is incorporated herein by 

10 reference). 



EXAMPLE 2 



DNA MODIFICATION WITH PHOTOBIOTIN 



20 Sheared DNA (1 ug/ml) in water, or in 10 mM 

tris HCI pH 8/1 mM EDTA (TE) that had been 
boiled for 2 minutes, and chilled in an ice bath, was 
mixed with an equal volume of photobiotin (1 
ttg/ml) (obtained from Vector Laboratories), or, 
25 more preferably, photobiotin acetate (1 ug/ml) 
(obtained from Clontech (Clonetech #5000-2 (5 mg 
in 2.5 mi water to a concentration of 2 ug/ul)) in a 
2 ml polypropylene tube. The open tube was 
placed in a Styrofoam rack floating in an ice bath 
30 10 cm underneath a GE Sunlamp (model RSM/H, 
equipped with a 275 W bulb). After approximately 
fifteen minutes the lamp was turned off and the 
reaction was brought to 100 mM tris HCI pH 9 by 
adding a 1 M stock solution. Unreacted photobiotin 
35 was removed by extracting three, and preferably 
four, times with water saturated n-butanol. After 
ethanol precipitation the modified" DNA was re- 
suspended at 2 mg/ml in TE (10 mM Tris HCI, 1 
mM EDTA, pH 8), or more preferably, EE (EE is 10 
40 mM (Na)Epps, pH 8.25; 1 mM EDTA, pH 8; ad- 
justed to a final pH of 8.0. EE is made as 10 X EE 
using 1 M (Na)Epps, pH 8.25, and 0.5 M EDTA, pH 
8). EPPS (N-(2-hydroxyethyl) piperazine-N'-3-pro- 
panesulonic acid) is a close chemical relative of 
45 HEPES. Its pK a is 8.0 (at 20* C) and the pK a 
changes only slightly with temperature (-0.007 
units/* C). The pH of 1 X EE at 100° C should be 
about 7.4. Keeping the pH neutral is important for 
this procedure in order to avoid depurination of 
50 DNA. This occurs rapidly at acid pH and high 
temperatures. Tris is inappropriate since the pK B 
drops rapidly with increased temperature. 

It is important to maintain a neutral pH during 
extensive high temperature incubations in order to 
55 minimize depurination of DNA (Undahl et aL, Bio- 
chemistry n:310-318 (1972)). EPPS was used be- 
cause its pK a (8.0 at 20* C) changes slowly with 
temperature (-0.007 units/* C. A 1 M stock solution 



21 



EP 0 372 524 A2 



22 



of EPPS pH 8.25 was used to make ai! EPPS 
containing buffers. 



EXAMPLE 3 



DNA REASSOCIATIONS 



Sau3A digested T1753 DNA (0.5 ug), 
biotinylated DNA from strain TD33.3 (10 ug), an 
end-labelled 84 base synthetic oligonucleotide 
(70,000 cpm), and 40 ug yeast tRNA (Sigma) were 
denatured in 2 X EE by boiling for 1 min. Lyophiliz- 
ed yeast tRNA is resuspended in 0.3 M NaOAc. It 
is preferably extracted twice with Phenol/chloroform 
(1:1) and EtOH precipitated, and resuspend such 
that the concentration is 20 ug/ul, prior to use. 

The 84-mer, which does not hybridize with 
yeast DNA, had been end labelled using TV poly- 
nucleotide kinase (New England Biolabs) and a- 
32 P-ATP (New England Nuclear). 

The oligonucleotides molecule was included as 
a marker for the fractions containing DNA that did 
not bind to avidin and was used to estimate overall 
yield. Any radioactive DNA, so long as it is not 
homologous to the modified DNA, can be used to 
monitor the efficiency of the reactions and to pin- 
point the fractions containing DNA during affinity 
chromatography. The ideal tracer should be long 
enough to resemble a Sau3a fragment; single 
stranded (so as to indicate the fate of the singte 
stranded molecules that stick more readily to the 
sides of tubes, etc.; and must not hybridize to the 
genome at 1 M NaCI, 65* C). Labelled DNA 
(100,000 cpm) and tRNA (20 ug) need only be 
added to the first reassociation reaction. 

The mixture was brought to 1 M NaCI/5 mM 
EDTA/ 100 mM Hepes pH 7.5 (1 x SHE) by adding 
4 x SHE. Final DNA concentrations are between 1 
and 2 mg/ml. The mixture was allowed to reas- 
sociate for 21-44 hr at 65 *C. Reactions reached 
C c t values greater than or equal to twenty times the 
C 0 ti/2 value for yeast DNA under these conditions. 

Alternatively, and more preferably, the mixture 
was lyophilized, resuspended in 4 ul 2.5 X EE, and 
then mixed with 1 ul 5 M NaCI. Hybridizations 
were carried out in a 65* C air incubator in 0.5 ml 
centrifuge tubes to maintain constant sample vol- 
umes. After 17 hr/at 65* C, 95 ul of EEN (EEN is 
0.5 M NaCI; 10 mM (Na)Epps, pH 8.25; 1 mM 
EDTA, pH 8; adjusted to a final pH of 8.0) was 
quickly and thoroughly mixed with the sample 
which was then combined with 100 ul of a 5% 
suspension of avidin coated polystyrene beads 



Baxter Healthcare Corporation, Pandex division, 
#31-040-1 (0.7-0.9 u diameter) (Syvanen et al., 
Nucl. Acids Res. 16:11327-11338 (1988)) that had 
been washed in EEN. The sample was incubated at 

5 room temperature for 30 min, added to the cup of 
a microcentrifuge tube filter unit (Millipore, #UFC3, 
OGV 00), and spun at 13,000 X g for 15 sec. The 
beads, which were retained by the filter, were 
washed once with 200 ul EEN. The unbound nu- 

70 cleic acid in the filtrate was precipitated at -70 *C 
following the addition of two volumes of ethanol. 
The resulting pellet was washed with ethanol, dried, 
and resuspended in 5 ul EE. 0.5 ul for subsequent 
analysis. The sample was combined with 10 ug 

15 biotinylated TD33.3 DNA, 20 ug yeast tRNA, and 
brought to 2 X EE. Four more cycles of denatur- 
ation, reassociation, and avidin selection were per- 
formed as above. Aliquots (1/10 of each unbound 
fraction) were saved after each cycle. tRNA (20 ug) 

20 was added as needed to maintain a visible pellet 
after centrifugation. The recovery of DNA not ca- 
pable of hybridization ranged from 80-90% per 
round, as judged by the loss of the labelled 
oligonucleotide. After 5 rounds of subtraction, the 

25 recovery of the labelled oligonucleotide was 44% 
of the theoretical yield. 

If desired, the above protocol can be modified 
to permit recovery of the bound DNA. The beads 
are washed with EEN, the bound DNA eluted with 

30 100 mM NaOH, and the filtrate is neutralized. 

Fragments that have T m 's near or below 65 *C 
in 1 M NaCI cannot hybridize efficiently under the 
hybridization conditions described above, and thus 
will be found in the unbound fraction after each 

35 round of subtraction. To insure that such fragments 
were completely denatured so that they could not 
be capped by adaptors and thus could not be 
amplified, a high temperature incubation step is 
preferably performed prior to the addition of the 

40 adaptors. 

For this step, approximately 20% of the DNA 
that was saved after cycles 1-4 and 10% of the 
unbound fraction from cycle 5 was resuspended in 
50 ul 1 M^NaCI/EE/ 400 ug/ml tRNA. After incuba- 

45 tion at 80* C for 30 min, EE (200 ul) was added, 
the samples were precipitated and washed with 
EtOH, and DNA was resuspended in 5 ul EE. Half 
of the DNA from each sample was ligated to Sau3A 
adaptors (50 ng) using 6 units of T+ DNA ligase 

so (Pharmacia) in 10 ul at 15* C for 7 hours. In 
addition, Sau3A digested T1753 DNA and the clon- 
ed, Sau3A cut, gel purified, Bglll fragment were 
ligated to adaptors. 

55 

EXAMPLE 4 
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BIOTIN-CELLULOSE CHROMATOGRAPHY 



in one embodiment of the present invention, 
the preparation of biotinylated and non-biotinylated 
nucleic acid molecules are incubated in the pres- 
ence of unbound avidin, in order to remove 
biotinyiated molecules from the sample. 

To accomplish such a procedure, avidin (10 ul 
of 1 mg/ml avidin per modified DNA) was 
added to the reassociation reaction and thoroughly 
mixed at room temperature. The reaction was 
brought to 1 x SHE by adding 4 x SHE, and the 
mixture was incubated at room temperature for 30 
min. Biotin celiuiose was prepared by mixing a 
50% slurry (30 ul slurry/ug modified DNA) with 4 x 
SHE (333 ul/ml slurry) and yeast tRNA (100 ug/ml 
slurry). The matrix was then washed twice by mix- 
ing with 50 ml 1 x SHE and spinning for 1 min in a 
clinical centrifuge. The avidin:DNA mixture was ad- 
ded to the washed biotin-ceilulose pellet, shaken at 
room temperature for 30 min and poured into a 
plastic column or disposable pipette tip plugged 
with siliconized glass wool. Alternatively, the DNA 
can be bound by passing the mixture over a biotin- 
ceilulose column directly. The column was washed 
with 1 x SHE and fractions containing radioactivity 
are collected. These fractions are extracted with 
50% phenol/50% chloroform, concentrated two fold 
by sec-butanol extraction, if desired, and then com- 
bined with two volumes of ethanol. After incubation 
for 5 min in a dry ice:ethanol bath and centrifuga- 
tion for 5 min at 15,000 g, the nucleic acid was 
resuspended in 0.5 ml 0.3 M sodium acetate and 
allowed to precipitate as before. The dried pellet 
was resuspended in TE and may be combined with 
fresh biotin modified DNA or amplified. 



EXAMPLE 5 



DNA AMPLIFICATION AND COLONY HYBRID- 
IZATION 



The DNA samples of Example 4 were sub- 
jected to 4 rounds of subtractive hybridizations and 
affinity chromatography. After each round, samples 
were amplified using the polymerase chain reaction 
or other means. Amplified DNA was labeled with 

and used to probe replicas of a yeast genomic 
library. Approximately 10 s yeast genomic colonies 
were blotted to each fitter. After 4 rounds of enrich- 
ment and amplification, additional rounds of enrich- 
ment and amplification did not change the number 
of colonies capable of hybridizing to the amplified 



DNA, or the specific colonies which hybridized to 
the labeled DNA. This result indicates that the 
method was successful in enriching for the desired 
gene sequences (i.e. such that the same amount of 
5 DNA contained fewer species in addition to the 
desired gene sequence, and thus were capable of 
hybridizing to fewer colonies in the genomic li- 
brary. 

In a subsequent experiment, the colonies which 
10 were found to be capable of hybridizing to the 
enriched and amplified labeled DNA was tested for 
its ability to hybridize to the previously cloned 
sequence which was known to correspond to the 
Del(lys2) deletion. All of the colonies which were 
75 capable of hybridizing to the DNA present after the 
fourth round of enrichment and amplification were 
also capable of hybridizing to sequences known to 
correspond to the Del(lys2) deletion. This results 
shows that the method actually resulted in the 
20 enrichment of the desired gene sequence. 



EXAMPLE 6 

25 

PREFERRED METHOD FOR SUBSTRACTION 



ao To demonstrate this embodiment of the inven- 

tion, the above-described yeast strains TD33.3 
(missing a 5 kb Bglll fragment covering the Iys2 
gene), and T1753 were again used to clone the 
Iys2 region. 

35 For this embodiment of the invention, DNA was 

purified and biotinylated as described above. The 
nucleic acid having an insertion, or containing DNA 
corresponding to the deletion whose cloning was 
desired is incubated in the presence of Sau3A (a 

40 10 fold overdigestion is acceptable). Restricted 
DNA was resuspended at 0.1 ug/ul in EE. 
Biotinylated DNA from a source carrying the dele- 
tion and non-biotinylated from a source having 
DNA which corresponds to the deletion were de- 

45 natured. mixed, and permitted to hybridize with 
complementary strands as described above. 

Rather than using free avidin in solution as a 
means for removing the unwanted biotinylated 
DNA, such DNA was removed from the sample by 

so permitting it to bind to polystyrene beads which 
had been previously coated with avidin. Commer- 
cially available avidin-coated beads were treated to 
remove azide or free avidin, which may be asso- 
ciated with such beads. To accomplish such re- 

55 moval, 2 ml 5% (w/v) Avidin beads were cen- 
trifuged for 1 minute in micro-centrifuge. The 
supernatant was recovered and discarded. The pel- 
let (containing the avidin beads) was resuspended 
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in 1 ml EE buffer. The suspension was centrifuged 
and washed two additional times, each using 1 mi 
of EE buffer. The final pellet was resuspended with 
EEN to 1 .8 ml or so (in consideration of the fraction 
of the AP beads which were estimated to have 
been lost in the washings). 

DNA was sonicated in -3 ml EE in 15 ml 
disposable tubes. Typically one would shear a few 
hundred ug of DNA carrying the deletion, prefer- 
ably for 15 seconds using an Ultrasonics model W- 
375 sonicator with a microtip in a continuous (i.e. 
not pulse) mode with an output setting to 5. Such 
treatment resulted in a DNA preparation whose 
DNA had an average length -3 Kb. The average 
length of the DNA is not critical. However, the 
sheared DNA should be sufficiently long that the 
chance of a fragment not reacting with photobiotin 
is negligible. Since about 1 in 100 nucleotides will 
be biotinylated, the fragments should be quite a bit 
larger than 100 bp. 

The DNA was concentrated to 0.5 ml using 
sec -Butanol ("sec-BuOH"). Approximately 2.5 ml of 
sec-BuOH was"added per ml of water. The mixture 
was vortexed for approximately 10 sec, and cen- 
trifuged in a clinical centrifuge for approximately 30 
sec. The upper phase (sec-BuOH) was removed 
and discarded. Optionally, one may re-extract with 
sec- BuOH, using a volume of sec-BuOH equal to 
the current aqueous phase volume. The aqueous 
phase was boiled for 3 minutes, and 50 ug 3 M 
NaOAc was added, and the sample was mixed. 
When using yeast DNA, a white precipitate may 
form when the NaOAc is added after the sec-BuOH 
concentration. This material is believed to be a 
polysaccharide, and may be removed by centrifu- 
gation. 

Approximately 1 ml EtOH was then added to 
the sample, in order to precipitate the DNA. EtOH 
precipitation was accomplished by incubating the 
sample for 5 minutes in a dry ice/EtOH bath. The 
mixture was* thereafter centrifuged for 5 minutes in 
a microcentrifuge, washed in 100% EtOH, and 
dried. 

DNA was resuspended in water at 1 ug/ul. 
Since Tris may react with the photobiotin, it is 
preferred to resuspend in water rather than in a 
Tris buffer (such as TE). 

DNA was biotinylated as follows: 1 volume of 2 
ug/ul photobiotin (in water) was mixed with 1 vol- 
ume of 1 ug/ul sheared DNA (in water) in a micro- 
centrifuge tube. An aliquot (100 ut) of the 
DNA/photobiotin mixture was pipetted into the bot- 
tom of a 2 ml microcentrifuge tubes (with wide 
angle conical bottoms). The tube (cap open) was 
floated in an ice bath 10 cm below a sun lamp as 
described above. After photobiotinylation has oc- 
curred, 0.2 volumes of 1 M Tris pH 9 were added. 



The sample was extracted 4 times with water- 
saturated n-BuOH, and the bottom (aqueous) phase 
was saved" each time. The volume of sample was 
brought to 0.45 ml, and 50 ul 3 M NaOAc was 
5 added to the microcentrifuge tube. The sample was 
ethanol precipitated as described above, and the 
final sample was resuspended in 0.4 volumes of 
2.5 X EE (ie.e at 2.5 ug/Ul). The final sample was 
then ready for use in subtractive hybridization. 

10 

EXAMPLE 7 



75 

SUBTRACTIVE HYBRIDIZATION 



For the first subtractive hybridization, 5 ul of a 
20 0.1 ug/Ul Sau3A digested " + " DNA (containing 
DNA corresponding to the deletion sought tc be 
cloned), 4 ul of 2.5 ug/ul sheared photobiotinylated 
"-" DNA (containing DNA carrying the deietion), 3 
ui 10 X EE, 1 ul labeled tracer DNA (-10 5 cpm of 
25 the previously discussed synthetic 84-mer), and 2 
Ul ug/Ul yeast tRNA. The mixture was boiled for 1 
min, lyophilized, and resuspended in 4 ul water. 1 
Ul 5 M NaCI was added, and the sample was 
mixed and then centrifuged. The sample was then 
30 incubated in a 65* C air incubator to Cot£20 X 

Coti/2- 

The hybridized DNA was incubated with the 
previously prepared avidin coated beads. The hy- 
bridization reaction was brought to a volume of 100 
35 ul with EEN with vortexing. 100 ul of washed 5% 
Avidin coated polystyrene beads in EEN were ad- 
ded to the 100 ul hybridization mixture. The mix- 
ture was then incubated for 30 minutes at room 
temperature. 

40 Unbound DNA was collected by pipetting the 

sample/bead mixture into the cup of a microcen- 
trifuge filter unit (Millipore, #UFC3 OGV 00 is a 
suitable microcentrifuge tube with a removable cup 
containing a 0.2 um PVDF filter). The unit was then 

45 centrifuged for 15 seconds in a microcentrifuge. 
The tube used for the hybridization and the pipette 
tip was then washed with 200 ul EEN and this 
wash solution was added to the cup of the filter 
unit. The unit was then centrifuged for an additional 

so 15 sec (the volume of the filtrate in the filter unit 
tube was -400 ul. Approximately 1 ml EtOH was 
added to the filtrate, and the sample was EtOH 
precipitated as described above. 

In order to perform subsequent subtractive hy- 

55 bridizations, the pellet was resuspended in the 
presence of 4 ul 2.5 ug/ui sheared photobiotinylat- 
ed "-" DNA in 2.5 X EE. If one desires to save a 

mammIa {a- -imr«lifiA<»4tAn than rtallA* chrtlllrf Ko 
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resuspended in 4.4 ul of DNA and 0.4 ul saved by 
diluting it into 5 ul of TE (prior to boiling). If one is 
concerned about having biotinylated genomic DNA 
in the sample, then the pellet should be resuspen- 
ded in TE, an aliquot (e.g. 1/10 ,h ) should be saved, 
and the bulk of the sample lyophilized, and re- 
suspended in 4 ul of 2.5 ug/Ul biotinylated DNA. 

Resuspended sample which was to be used for 
additional subtractive hybridizations was then 
boiled for 1 minute, after which, 1 ul 5 M NaCI was 
added. The sample was then mixed and centrifug- 
ed, and the amount of radioactive tracer deter- 
mined by Cerenkov counting. The sample was then 
incubated in a 65* C air incubator, preferably under 
paraffin oil to minimize any evaporation of solvent, 
to C 0 t£20 X Cti/2- Unbound DNA was collected as 
described above. 

Fragments that have T m 's near or below 65* C 
in 1 M NaCI cannot hybridize efficiently under the 
hybridization conditions used in these experiments 
and thus will be found in the unbound fraction after 
each round of subtraction. To insure that such 
fragments were completely denatured so that they 
could not be capped by adaptors and thus could 
not be amplified, the above-discussed high tem- 
perature incubation step is preferably conducted 
prior to the addition of the adaptors. 

Thus, DNA from wild-type strain T1753 was cut 
with Sau3A. DNA from strain TD33.3 was sheared 
by sonication to average size of 3000 bp. Starting 
with 0.5 ug of wild-type DNA and using 10 ug of 
biotinylated deletion mutant DNA in each round, 5 
cycles of subtraction were performed on the sam- 
ple prepared in accordance with Example 6. After 
each cycle, an aliquot of the unbound DNA onto 
which Sau3A adaptors were ligated was saved. A 
fraction of each aliquot was amplified using the 
polymerase chain reaction. 

For such amplification, a fraction (e.g. 1/1 0th) 
of the unbound DNA was added to a 500 ul tube 
containing: 44 ul water; 10 ul 5 M NaCI; 5 ul 10 X 
EE; and 1 Ul 20 ug/Ul yeast tRNA. The mixture 
was incubated at 80 # C for 30 min in a PCR 
(Polymerase Chain Reaction) machine or water 
bath. At the conclusion of the incubation, 200 Ul EE 
was added to the sample, and the DNA was EtOH 
precipitated in 500 ul EtOH as described above (- 
70 *C, 5 min; centrifuge, 5 min; wash with 100% 
EtOH, dry. After such precipitation, the DNA was 
resuspended in 5 ul EE. 

To amplify the DNA, adapters of the termini 
were added. It is desirable to ligate adaptors onto a 
Sau3A digest of vector DNA. This will serve as a 
positive control in the amplification reaction. It may 
also be useful to ligate adaptors onto the original 
Sau3A digested n + " DNA. 

To add such adapters, 2.5 ul "melted" un- 
bound DNA was mixed with 2 ul 25 ng/ui Sau3A 



adaptor, 1 ui 10 X ligation buffer, 4 ul TE, and 0.5 
ul T* DNA ligase. The reaction mixture was in- 
cubated at 15* C for Z 2 hr. 

5 

EXAMPLE 8 



DNA AMPLIFICATION AND COLONY HYBRID- 
IZATION 



For DNA amplification, DNA capped with 
75 adaptors (1/10 of the ligation reaction) was am- 
plified in a thermal cycler (Ericomp) with the 
GeneAmpTM Kit (Perkin-Elmer Cetus Instruments) 
using 0.5 mg of phosphorylated adaptor strand 
GACACTCTCGAGACATCACCGTCC as a primer. 
20 The primer was phosphorylated to facilitate the 
subsequent cloning of the amplified DNA. Each of 
the 50 cycles included 3 segments: 30 sec at 
93* C, 30 sec at 55' C, and 3 min at 72* C. 0.5 ug 
of phosphorylated adaptor strand is used to prime 
25 the polymerase reaction. It is important to use the 
correct strand (i.e. the strand whose 3' end is part 
of the adaptor's Sau3A site). The strand is 
prefereably phosphorylated to facilitate cloning of 
the amplified DNA. The following controls are use- 
30 ful: - 10 fg " + " DNA cut with Sau3A capped with 
adaptors; - 10 fg vector DNA cut with Sau3A 
capped with adaptors; and a Primer only control 
(no template). 

Amplified DNA (1/10 of the sample) was de- 
35 natured in the presence of adaptor strand GACAC- 
TCTCGAGACATCACCGTCC (0.2 ug) and labelled 
using [a-^PJdCTP and Kienow fragment of DNA 
polymerase I). 

Free nucleotides were removed from the am- 
40 plified DNA by spinning samples through columns 
(Maniatis et al.. Molecular Cloning: A Laboratory 
Manual (Cold Spring Harbor Laboratory, Cold 
Spring Harbor, New York) (1982)) composed of 
extensively washed Sephadex-G25 (Pharmacia). 
45 An electrophoretic analysis of the amplified 

DNA derived from cycles 1-3 was performed using 
a 2% agarose gel containing ethidium bromide 
(EtBr) (0.25 ug/Ul). The products from rounds 4 
and 5 looked identical to those from round 3. The 
so products obtained upon amplification of 10 fg of 
the Sau3A cut cloned Bglll fragment was also de- 
tected. The pattern of bands obtained after 3 cy- 
cles looks nearly identical to the pattern of the 
amplified Sau3A cut, cloned Bglll fragment that is 
55 deleted in strain TD33.3. This indicated that a high 
degree of enrichment was achieved for the frag- 
ments corresponding to the deletion after 3 rounds 
of subtraction. Comparing the amplified Sau3A dt- 
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gest of the cloned deleted fragment to the unam- 
piified digested fragment, it was apparent that sev- 
eral large bands were not amplified efficiently. 
Similarly, comparison of unamplified and amplified 
Sau3A cut T1753 DNA indicated that fragments 
larger than about 700 bp were not efficiently am- 
plified. The fragments in the unampilfied samples 
were not capped with adaptors and thus migrated 
faster than corresponding fragments in the other 
lanes. 

The amplified DNA from rounds 2 and 3 was 
labelled and used to probe replica filters made 
from a plate containing about 10 5 yeast genomic 
plasmid clones. For such colony hybridization, am- 
plified DNA (1/10 of the sample) wad denatured in 
the presence of adaptor strand GACACTCTC- 
GAGACATCACCGTCC (0.2 ug) and labelled using 
[a- 32 P]dCTP and Klenow fragment of DNA poly- 
merase I (Ausubel et aL, Current Protocols in Mo- 
lecular Biology (Greene Publishing Associates and 
John Wiley & Sons, New York) (1987)). Plasmid 
Ylp33.3 containing the cloned yeast Iys2 gene was 
digested with Bglll and Xhoi. The two fragments 
comprising the Bglll fragment that is deleted in 
strain TD33.3 were gel purified and labelled using 
random hexamers as primers (Feinberg et al. f Anal. 
Biochem. 137:266-267 (1984)). Replica fTlteTs "co!v 
taining about 1 X 10 s yeast genomic clones were 
hybridized to the labelled DNA (Hanahan et al., 
Gene 10:63-67 (1980); Grunstein et aL, Proc.TlaFl. 
Acad. Sci. USA 72:3961-3965 (1975)). 

The cloned 5 kb Bglll fragment which is miss- 
ing in strain TD33.3 was labelled and hybridized to 
a replica filter made from the same plate. Genomic 
clones containing sequences that are deleted in 
strain TD33.3 were observed by autoradiogram of 3 
replica filters, as spots which hybridized to the 
labelled 5 kb Bglll deleted fragment. After hy- 
bridization with the probe derived from the third 
cycle, the prominent signals corresponded to spots 
generated by the labelled 5 kb Bglll fragment. 
Thus, 3 rounds of genomic subtraction provided 
sufficient enrichment to identify clones containing 
sequences that are absent in the deletion mutant. 
Most of the colonies that hybridize to the probe 
from round 2 do not contain deleted sequences; 
however, several of the darker signals are superim- 
posable on the spots on the filter that was probed 
with the cloned deleted sequences indicating that 
some enrichment for deleted sequences has oc- 
curred after 2 cycles of subtraction. 

A small fraction of the clones that are pin- 
pointed by the cloned deletion probe did not ap- 
pear as spots on the round 3 filter. Analysis of 
duplicate filters probed with the round 3 probe 
indicate that the spots are missing due to unfaithful 
replica plating. Also, in an independent genomic 

" ' • - ' A -" -.1* iU.i 



hybridized to the cloned deletion probe were also 
pinpointed by the experimental probe. 

Analysis of cloned amplified unbound DNA in- 
dicated average enrichments of approximately 10 

s fold during each of the first three rounds. About 
95% of the clones derived from the fourth round 
hybridized to the deleted sequences. 

Results similar to those reported above were 
obtained in two additional independent genomic 

/o subtraction experiments. In one experiment, 5 
rounds of subtraction were required to obtain re- 
sults comparable to that described above. In the 
other experiment (Examples 4 and 5), DNA was 
incubated with free avidin and bound the samples 

75 to a biotin-cellulose matrix instead of using avidin 
coated beads (Kasher et al., Mol. Cell. Biol. 6:3117- 
3127 (1986)). Enrichments comparable to" those 
reported here were obtained after 4 cycles in the 
latter experiment. 

so Attempts to perform genomic subtraction when 

adaptors were added at the beginning of the ex- 
periment (i.e. after Sau3A digestion of the wild-type 
DNA) have not resulted in enrichments equal to 
those shown here. This may be due to the am- 

25 plification of significant amounts of DNA that re- 
mains unbound during the affinity steps for reasons 
other than that they correspond to a chromosomal 
deletion. 

In summary, using genomic subtraction we effi- 

30 ciently isolated sequences that are absent in a 
deletion that spans 1 /4000 th of the yeast genome. 
Cloning sequences that correspond to deletions 
that represent smaller fractions of a genome should 
be possible simply by performing more rounds of 

35 subtraction. 

While the invention has been described in con- 
nection with specific embodiments thereof, it will 
be understood that it is capable of further modifica- 
tions and this application is intended to cover any 

40 variations, uses, or adaptations of the invention 
following, in general, the principles of the invention 
and including such departures from the present 
disclosure as come within known or customary 
practice within the art to which the invention per- 

45 tains and as may be applied to the essential fea- 
tures hereinbefore set forth as follows in the scope 
of the appended claims. 

so Claims 

1. A method for enriching for a desired gene 
sequence absent in one population of nucleic acid 
molecules but present in another which comprises: 
55 a) mixing a first population of denatured nu- 

cleic acid molecules suspected of possessing said 
desired gene sequence with an excess amount of a 

«<^«^«rl ^Annl«4in n r\f Honati trorl ni m\air> aoirl mnl. 
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ecules suspected of lacking said desired gene se- 
quence, wherein said second population of mol- 
ecules has been labelled with a label, and wherein 
said first population of molecules lacks said label; 

b) permitting the mixed molecules of said 5 
first and second populations of molecules to hy- 
bridize and thereby form double-stranded nucleic 
acid molecules; and 

c) isolating any double-stranded molecules 

of said mixture which are composed solely of re- to 
natured molecules of said first population. 

2. The method of claim 1 wherein said label of 
said labelled second population of molecules is a 
biotin label, or an antigen label. 

3. The method of claim 1 wherein said step (c) 75 
comprises the sub-steps: 

i. incubating said mixed first and second 
populations of molecules with an agent capable of 
binding to said label of said second population of 
molecules; 20 

ii. permitting said agent to bind to said label 
of said second population of molecules; and 

iii. separating any of said second population 
of molecules from said mixture by selectively sepa- 
rating said bound agent, along with its bound, 25 
labelled, molecule of said second population of 
molecules, from said mixture. 

4. The method of claim 3 wherein said label of 
said labelled second population of molecules is a 
biotin label. 30 

5. The method of claim 3 wherein said agent is 
an antibody. 

6. The method of claim 3 wherein said agent is 
avidin. 

7. The method of claim 6 wherein said avidin is as 
in solution. 

8. The method of claim 7 wherein said avidin in 
solution, after binding to said labelled molecule of 
said second population of molecules, is removed 
from solution along with said bound, labelled, mol- ao 
ecule by affinity separation with biotin, or by im- 
munoaffinity separation with an anti-avidin antibody. 

9. The method of claim 6 wherein said avidin is 
bound. 

10. The method of claim 9 wherein said avidin 45 
is bound to a bead or a resin. 



so 
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